Corpus: ltz-lu_web_2020_30K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 89 99 99 99 99
1000 773 979 997 997 998
10000 5076 9028 9806 9894 9923
100000 11930 25327 28967 29525 29692
1000000 11930 25327 28967 29525 29692


Zipf's diagram for sentence endings


Gnuplot diagram

3130 msec needed at 2021-09-17 03:02